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REMARKS 

In paragraph 3, on page 2 of the Office Action, the 
Examiner rejects Claims 85-106 under 35 U.S.C. § 112, first 
paragraph . 

Specifically, the Examiner states that the recitation "at 
least about 10 nucleotides in length" is not supported in the 
specification, i.e., the Examiner contends that the 
specification (see page 10, and original Claim 5) only supports 
"about 10 to about 20 nucleotides". 

For the following reasons, Applicants respectfully traverse 
the Examiner's rejection. 

The test for sufficiency of support in an application is 
whether the disclosure of the application relied upon "reasonably 
conveys to the artisan that the inventor had possession at that 
time of the later claimed subject matter" (In re Kaslow, 
217 USPQ 1089 (Fed. Cir. 1983)). 

As stated by the Federal Circuit in Ralston Purina Company 
v. Far-Mar-Co. , Inc., 227 USPQ 177 (Fed. Cir. 1985): 

Far-Mar-Co cites several range cases to support its 
argument that ranges found in the applicant's claim 
language must correspond exactly to ranges disclosed in 
the parent. These cases are not on point. The facts in 
these cases precluded a determination that one skilled 
in the art could derive the claim limitations from the 
parent, due to a number of different factors, e.g., the 
unpredictable nature of the art, In re Sichert, 
566 F.2d 1154, 196 USPQ 209 (CCPA 1977); failure to 
distinguish one process from another, In re MacLean, 
454 F.2d 756, 172 USPQ 494 (CCPA 1972); the addition of 
a critical limitation, In re Blaser, 556 F.2d 534, 
194 USPQ 122 (CCPA 1977); failure to define a critical 
term, In re Lukach, 442 F.2d 967, 169 USPQ 795 (CCPA 
1971) ; and use of a list that did not contain the 
claimed substance. In re Ahlbrecht, 435 F.2d 908, 
168 USPQ 293 (CCPA 1971) . In addition, a predecessor to 
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this court has held "that a claim may be broader than 
the specific embodiment disclosed in a specification is 
in itself of no moment . " In re Rasmussen, 
650 F.2d 1212, 1215, 211 USPQ 323, 326 (CCPA 1981)... 

[T]he court did not clearly err in determining that the 
parent's disclosure adequately supported the water 
ranges of "at least about 25% by weight," and "at least 
25% by weight." [where the examples of the parent 
application showed a water content that equaled 25% 
and 27%] . (Emphasis added) 

There is no unpredictable nature with respect to the size of 
the oligonucleotide of the present invention, as it is well-known 
in the art that probes can be greater than 2 0 nucleotides. 
Further, the upper limit of the size of the probe is not relevant 
to distinguishing over the prior art, i.e., such is not a 
critical limitation or a critical term. 

The specification, at page 5, teaches that the nucleic acids 
may be variable in length, and that in one embodiment, they are 
from about 10 to about 2 0 nucleotides in length. Thus, the range 
of "about 10 to about 20" is merely an example of one embodiment. 

Clearly, the examples in the present application (see also, 
e.g., Claim 88) teach that the oligonucleotides can be 10 or 
greater nucleotides in length, and can be greater than 
20 nucleotides (e.g., positions 11821-11844 of SEQ ID N0:1 
(i.e., 24 amino acids) and positions 12945-12924 of SEQ ID NO:l 
(i.e., 22 amino acids)). 

Similarly, in In re Wertheim, 191 USPQ 90 
(CCPA 1976) the asserted claims covered a range ("solids level 
of at least 35%"), whereas the specification disclosed a range 
("concentrated . . . until a concentration of 25 to 60% solid 
matter is reached").. The court held that the claim found 
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written* description in the specification even though the claims 
was not repeated verbatim in the specification. 

Hence, it is clear that the specification clearly conveys 
to one skilled in the art "at least about 10" as recited in the 
claims . 

In addition, in paragraph 5, on page 4 of the Office 
Action, the Examiner states that the method of Claim 101 
requires one to perform hybridization reactions between a probe 
of undefined sequence to a target sequence found in any of a 
variety of "sugar-pathway genes". The Examiner contends that 
the specification fails to provide an adequate written 
description of the coding sequence for "sugar-pathway genes" as 
found in any of the encompassed microorganisms, nor an adequate 
written description of the probes and primers that would be used 
in the hybridization and detection aspects of the method. 

For the following reasons, Applicants respectfully traverse 
the Examiner's rejection. 

It is well-settled law that an application need not teach, 
and preferably omits, that which is well-known in the art 
{Hybritech Jnc. v. Monoclonal Antibodies, Inc., 231 USPQ 81 
(Fed. Cir. 1986) ) . 

Applicants respectfully submit that the sequences of 
oligonucleotide molecules which hybridize to the "sugar-pathway 
genes specific to the bacterial strains to be detected" were 
well-known to one skilled in the art, and/or ones skilled in the 
art would have been able to identify oligonucleotide molecules 
capable of hybridizing thereto without undue experimentation 
(see the last paragraph at page 6 of the present specification) . 
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Moreover, Applicants submitted with the Amendment After 
Final filed December 23, 2002, a list of citations dated prior 
to 1997 (Appendix B) and copies of related references 
(Appendix C) which disclose a range of genes associated with 
sugar biosynthetic pathways. These citations show that 

sugar-pathway genes were well-known in the art, and formed part 
of the common general knowledge. As a result, one skilled in 
the art would have been able to use her common general knowledge 
and routine techniques available in the art to produce 
oligonucleotide molecules capable of hybridizing to such genes. 

In the outstanding Office, the Examiner has apparently 
overlooked these references in maintaining the outstanding 
rejection. 

Applicants wish to add the following citation to those 

previously submitted: 

Keenylside et al "A Novel Pathway for O-Polysaccharide 
Biosynthesis in Salmonella enterica Serovar Borreze" , 
Journal of Biological Chemistry, Volumn 271, No. 45, 
pages 28581-28592 (1996); a copy of which is enclosed 
herewith. 

The above reference clearly shows that sugar-pathway genes 
were well-known in the art, and formed part of the common general 
knowledge . 

Applicants respectfully direct the Examiner's attention to 
the specific instances of the use of the term "pathway" in 
"Escherichia coli and Salmonella typhimurium: Cellular and 
Molecular Biology, Vol. 1", particularly at pages 504 and 507. At 
these sections, the use of the term "pathway" clearly relates to 
the biosynthetic processes involved in the synthesis of sugars and 

- 15 - 



AMENDMENT 

U.S. Appln. No. 09/423,093 

more complex carbohydrates, and that by the additional use of the 

terms "sugar" and "gene", there is explicitly described to the 

person skilled in the art a series of genes directly associated 

with these biosynthetic pathways. This is even more clearly 

described at pages 63 and 64 in "Physiology of the Bacterial Cell, 

A Molecular Approach" (Neiderhardt et al) . 

Furthermore, in the article by Keenylside et al, it is stated 

at page 28581, second column, last paragraph: 

Enzymes encoded by the rfjbb:54 cluster direct the 
synthesis of an N-acetylmannosamine (ManNAc) -containing 
O-polysaccharide with the novel disaccharide repeat 
unit .polysaccharide synthesis requires a functional 
chromosomal rfe gene. 

This paragraph also states that: 

Two LPS O- polysaccharide pathways have been described, 
and Rfe may play a role in either of these pathways . 
The 0:54 O-polysaccharide shares a number of 
characteristics with the O-polysaccharide synthesized by 
the Rfe- independent O- polysaccharide biosynthetic 
pathway. (Emphasis added) 

The above clearly describes sugar biosynthetic gene pathways. 

Accordingly, Applicants respectfully submit that the claims 
have adequate written description and are enabled by the 
specification, and thus request withdrawal of the Examiner's 
rejection. 

In paragraph 11, on page 4 of the Office Action, the 
Examiner rejects Claim 101 under 35 U.S.C. § 112, second 
paragraph. 
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Specifically, the Examiner contends that Claim 101 is 
indefinite with respect to which genes are considered to 
constitute "sugar-pathway genes". 

As discussed above, "sugar-pathway genes" were well-known 
to one skilled in the art, and thus this expression is not 
indefinite . 

Accordingly Applicants respectfully submit that the claims 
clearly and definitely recite the invention of interest. Thus, 
the Examiner is requested to withdraw the rejection. 

In view of the arguments set forth above, reexamination, 
reconsideration and allowance are respectfully requested. 

The Examiner is invited to contact the undersigned at his 
Washington telephone number on any questibn^l whie^ might arise. 



Respec£rul/ly submitted, 



GordOT Kit 

Registration No. 30,764 



SUGHRUE MION, PLLC 

Telephone: (2 02) 293-7 060 
Facsimile: (2 02) 293-7860 

WASHINGTON OFFICE 

23373 

CUSTOMER NUMBER 
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The plasmid-encoded gene cluster for 0:54 (^polysac- 
charide synthesis in Salmonella enterica serovar Bor- 
reze (rfboxJ contains three genes that direct synthesis 
of a ManNAc homopolymer with alternating 01,3 and 
01,4 linkages. In Escherichia coli K-12, RfbAo^ adds the 
first ManNAc residue to the Rfe (UDP-GlcpNAc::undeca- 
prenylphosphate GlcpNAc-1 -phosphate transferase)- 
modified hpopolysaccharide core. Hydrophobic cluster 
analysis of RibA^^ indicates this protein belongs to the 
ExoU family of nonprocessive 0-glycosyltransferases. 
Two putative catalytic residues and a potential sub- 
strate-binding motif were identified in RfbAo^. Topo- 
logical analysis of RfbBo^ predicts four transmem- 
brane domains and a large central cytoplasmic domain. 
The latter shares homology with a similar domain in the 
processive 0-glycosyltransferases Cps3S of Streptococ- 
cus pneumoniae and HasA of Streptococcus pyogenes. 
Hydrophobic cluster analysis of RfbBo^ and Cps3S in- 
dicates both possess the structural features character- 
istic of the HasA family of processive 0-glycosyltrans- 
ferases. Four potential catalytic residues and a putative 
substrate-binding motif were identified in RfbB Q:54 . In 
brfb E. coli K-12, RfbAo^ and RfbB Q;54 direct synthesis 
of smooth 0:54 lipopolysaccharide, indicating that this 
O-polysaccharide involves a novel pathway for O-anti- 
gen transport. Based on sequence and structural conser- 
vation, 15 new ExoU-related and 17 new HasA-related 
transferases were identified. 



Lipopolysaccharide (LPS) 1 is a major component of the outer 
membrane of Gram-negative bacteria. The hydrophobic lipid A 
portion forms the outer leaflet of the outer membrane. In enteric 
bacteria, an O-polysaccharide is attached to lipid A via a short 
core oligosaccharide. The O-polysaccharide extends away from 
the cell surface to give a hydrophilic surface layer and represents 
a major surface antigen (O-antigen). It is characterized by a 
repeat unit structure, with different epitopes arising from vari- 
ations in the nature, order, and linkage of the sugar monomers in 
the O-repeat unit and from the addition of side branches and 
"decorations" such as acetyl, ketal, and glycosyl residues. 
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Members of the genus Salmonella are serologically diverse. 
Approximately 50 O-antigens are recognized in Salmonella, 
and this diversity, combined with variable flagellar antigens, 
has resulted in the identification of greater than 2,000 serolog- 
ically distinct Salmonella strains. Strains are assigned to se- 
rogroups based on shared expression of major O-antigen 
epitopes or factors (1). Synthesis of the O-antigen is directed by 
the products of the rfb gene cluster, and the structural diversity 
seen in the LPS O-antigens reflects variation in the rfb gene 
clusters; this variation is thought to have arisen from repeated 
lateral gene transfer and recombination events involving rfb 
genes. Selective pressure from the host immune response is 
proposed to be the driving force for the continued generation of 
antigenic diversity (2). 

Serogroup 0:54 is a heterogeneous group of 13 different 
Salmonella serovars. It is unique among Salmonella sero- 
groups in that expression of the 0:54 antigen requires the 
presence of a small plasmid. This is the only known plasmid- 
encoded O-polysaccharide in Salmonella. We have studied the 
6.9-kb plasmid pWQ799 from serovar Borreze to determine the 
role of these plasmids in expression of the 0:54 antigen (3). 
pWQ799 is a novel ColEl-related plasmid carrying the entire 
rfb Q:&4 biosynthetic cluster (4). This is the only reported exam- 
ple of a ColEl-related plasmid carrying genes for the synthesis 
of cell-surface antigens. The plasmid is mobilized in the pres- 
ence of an appropriate helper plasmid, providing the first de- 
fined mechanism for lateral gene transfer of O-antigen biosyn- 
thesis genes in Salmonella enterica. Mobilization of plasmids 
containing rfb 0:64 into strains possessing a chromosomally en- 
coded rfb explains the simultaneous expression of two distinct 
O-antigens in members of this serogroup (3) as well as the 
serotyping results of plasmid-cured 0:54 derivatives. 

Enzymes encoded by the rfb 0:&4 cluster direct the synthesis 
of an iV-acetylmannosamine (ManNAc)-containing O-polysac- 
charide with the novel disaccharide repeat unit, -»4)-/3-D-Man- 
pNAc-(l->3)-/3-r>-ManpNAc-(l— 0:54 polysaccharide synthe- 
sis requires a functional chromosomal rfe gene (3). Rfe 
transfers GlcpNAc-l-phosphate to undecaprenol-phosphate to 
form the first lipid intermediate (lipid I) in synthesis of the 
enterobacterial common antigen (5). Two LPS O-polysaccha- 
ride pathways have been described, and Rfe may play a role in 
either of these two pathways (reviewed in Ref. 6). The 0:54 
O-polysaccharide shares a number of characteristics with the 
O-polysaccharides synthesized by the Rfc-independent O-po- 
lysaccharide biosynthetic pathway including the following: 1) it 
is a homopolymer, and 2) Rfe is required for synthesis although 
GlcNAc does not form part of the repeating unit. The Rfc- 
independent pathway has so far only been identified for a few 
homopolymeric O-polysaccharides in Klebsiella 01 (7, 8), Esch- 
erichia coli 08 and 09 (9-11), and Serratia marcescens 016 
(12). In this pathway, Rfe initiates synthesis by transferring 
GlcpNAc-l-P to the carrier lipid to form the acceptor upon 
which the complete O-polysaccharide chain is subsequently 
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polymerized. GlcpNAc therefore does not form part of the re- 
peating unit. O-polysaccharides synthesized by this pathway 
are polymerized in the cytoplasm (13), and then the completed 
polymer is transported across the plasma membrane by an 
r/b-encoded dedicated ABC (ATP-binding cassette) transporter, 
prior to ligation to lipid A core. Analysis of rfb 0:64 and 0:54 
expression indicates that transport of the nascent 6:54 polysac- 
charide occurs by a different mechanism. This communication 
describes the organization of rfb 0:64 and the characterization of 
the two glycosyltransferases that direct the synthesis of the 
0:54 O-polysaccharide. Our studies indicate that synthesis of 
0:54 LPS follows a novel mechanism distinct from either of the 
two characterized O-polysaccharide pathways. 

EXPERIMENTAL PROCEDURES 

Bacterial Strains, Growth Conditions, and Plasmids— Bacterial 
strains were grown in Luria-Bertani broth supplemented, where appro- 
priate, with antibiotics (ampicillin, 100 jig/ml; kanamycin, 50 /xg/ml; 
chloramphenicol, 30 /Ag/ml; tetracycline, 15 jig/ml). For high level ex- 
pression of RfbAo. M and RfbB^^, 100 mM ifiopropy l-/3-D-gal actopyrano- 
side was added to LB broth cultures at a final concentration of 0.5 mM. 
TvphoA mutants were examined on LB agar plates supplemented with 
40 /ig/ml 5-bromc~4-chloro-3-indolyl phosphate toluidine salt. The host 
for TnphoA mutagenesis was E. coli CS118 (14), genotype A(ara leu) 
7697 MacX74 *phoA20 galE galK thi rpsE rpoB argE(Am) recAL LPS 
expression was examined in E, coli K-12 strains DH5a (15), S4&74 (16) 
or 21548 (17). The latter two strains have the following genotypes: 
Stf>874, lacZ trp bsbcB-rfb upp rel rpsL; 21548, thr-1 leuB6 Mgpt- 
proAm hisG4 argE3 thi-1 rfbDl lacYl ara-14galK2 xyl-5 mtl-1 mgU51 
rpsL31 kdgKSl supE44 rfe::Tn!0-48. The plasmids used in this study 
are shown in Fig. 1. The construction of plasmids pWQ800 and pWQ802 
has been previously reported (3, 4). pWQ819 contains a 2.3-kb EcoRl- 
Kpnl fragment from pWQ799 cloned in pTrc99A (Pharmacia Biotech 
Inc.). pWQ820 was constructed by cloning the 2.4-kb Mlul fragment 
from pWQ799, after filling in the ends, into the Smal site of pKK232-8 
(Pharmacia), and then removing the insert using the flanking EcoKl 
and BamHl sites from the vector multiple cloning region. This fragment 
was cloned into EcoRI-BamHI-digested pGEM-7Zft+) (Promega) to 
generate pWQ820. pWQ822 contains the 1.14-kb ifmcII-SspI fragment 
of pWQ799 inserted at the Smal site of the expression vector pKK232-8. 
Plasmid pWQ823 was constructed by first cloning the 1.14-kb ffmcll- 
Sspl fragment into the EcoRV site of pGEM-5Zf(+) (Promega) so that 
the direction of transcription was toward the SP6 promoter. The insert 
was then removed using the Pstl and Ncol sites of the vector and 
cloning this into NcoLPstl-digested pTrc99A. 

DNA Manipulation and Analysis — Restriction enzyme digestions, 
ligations, and CaCls transformations were performed as described by 
Sambrook et al. (15). DNA fragments to be subcloned were gel-purified 
using the GENECLEAN kit (Bio/Can Scientific, Mississauga, Ontario, 
Canada). Plasmids were column purified using QIAGEN spin columns 
(QIAGEN Inc., Chatsworth, Calif.) according to the manufacturer's 
instructions. 

Nucleotide sequencing procedures are described elsewhere (4). Se- 
quence data were edited and analyzed using AssemblyLIGN and 
MacVector software (International Biotechnologies Inc., New Haven, 
CT). Hydropathic analysis of the predicted protein sequence was done 
using the algorithm of Rao and Argos (18) with a minimum length for 
the transmembrane helix of 16. Homology searches of nucleotide and 
amino acid sequences in the National Center for Biotechnology Infor- 
mation data bases were done with the BLAST (basic local alignment 
search tool) server analysis program (19). Multiple alignments were 
performed using Clustal (20). Hydrophobic cluster analysis (HCA) was 
performed using the HCA-Plot program (Doriane Informatique, Le 
Chesnay, France). This program writes protein sequences on a dupli- 
cated a-helical net and circles clusters of hydrophobic amino acids (Ala, 
Val, Leu, lie, Met, Phe, Gin). The plots are then visually compared for 
similarity in the hydrophobic cluster patterns, limiting analysis to the 
predicted globular portions of the proteins. 0-Strands and a-helices are 
deduced based on the observed association of specific hydrophobic clus- 
ter shapes with secondary structures (21). 

TnphoA Mutagenesis of Plasmid-encoded rfb Q:S4 Determinants— 
Plasmid-encoded rfb oli4 genes were mutagenized using TnphoA as de- 
scribed previously by Manoil and Beckwith (14) . Fusions were made in 
plasmid pWQ800 (3). The precise site of insertion was mapped by 
sequencing out of the phoA orf using a primer (5'-CAGTAATATCGC- 
CCTGAGCA-3') that is complementary to the phoA sequence between 



nucleotides 79 and 98 (22). 

RNA Isolation and Primer Extension — RNA was purified from a 
mid-log phase culture of E. coli DH5a (pWQ820) using TRIzol Reagent 
(Life Technologies, Inc.) according to the manufacturer's instructions. 
Primer extension reactions were done using the oligonucleotide primer 
TS154 (5'-TTTCATAATGTCGATCTGTTAATCC-3' ) that corresponds 
to the complementary sequence to nucleotides 3676-3700 of plasmid 
pWQ799. The primer was end-labeled with [y 32 P]ATP (DuPont NEN) 
and T4 polynucleotide kinase (Boehringer Mannheim, Laval, Quebec, 
Canada) and then purified using a QIAquick spin column (QIAGEN 
Inc., Chatsworth, CA). Primer extension experiments were performed 
using the First-Strand cDNA Synthesis Kit from Pharmacia (Pharma- 
cia Biotech, Baie DTJrfe, Quebec) following the manufacturer's recom- 
mendations. DNA sequencing was done with the Sequenase version 2.0 
sequencing kit (U. S. Biochemical Corp.). 

LPS Extraction and Purification of Core Oligosaccharides — LPS 
samples were prepared either from SDS-proteinase K whole cell ly- 
sates, as described by Hitchcock and Brown (23), or by using a modifi- 
cation of the phenol/water extraction method; LPS was collected from 
both the aqueous and phenol phases (3). For compositional analysis of 
the LPS core oligosaccharides, lipid A was removed from phenol-puri- 
fied LPS by hydrolysis in 1.5% acetic acid at 100 °C for 2 h. Precipitated 
lipid A was removed by centrifugation, and the supernatant was lyoph- 
iUzed. Samples were resuspended in water to a final concentration of 5 
mg/ml, hydroiyzed, and analyzed by high performance anion-exchange 
chromatography as described previously (24). 

TricineSDS-PAGE of LPS— LPS samples were analyzed by SDS- 
PAGE using commercially prepared 10-20% gradient Tricine gels from 
Novex (San Diego, CA). Electrophoresis conditions were those specified 
by the manufacturer. Gels were silver-stained using the method of Tsai 
and Frasch (25). 

RESULTS 

Organization ofrfb 0:S4 — Previous analysis of the nucleotide 
sequence and genetic organization of pWQ799 indicated that 
approximately half of the 6915-base pair plasmid (nucleotides 
72-3384) is involved in plasmid replication and mobilization 
(4). These regions are related to ColEl and possess an average 
G + C content of 50-53% (Fig. 1A). This value is typical of 
Salmonella genomic DNA (26). In contrast, the remaining 
pWQ799 sequences have a uniformly lower average G + C 
content of 39%, with no detectable homology to any known 
ColEl-related sequences. The junctions between the high and 
low G + C regions in pWQ799 coincide with the 5' and 3' ends 
of the plasmid replicon regions. Abnormally low G + C values 
relative to those typical for the species is a common observation 
for genes involved in polysaccharide synthesis (27). These ob- 
servations therefore suggested that the remaining DNA con- 
tained rfb 0 54 determinants. 

Computer analysis for coding regions combined with se- 
quence homology searches in the NCBI data bases (see below) 
resulted in the identification of three potential open reading 
frames (Fig. IB; nucleotides 3547-4364, 4364-5740, and 
5740-6873) designated rfbAo :&4) rfbB Q:64 , and rfbC 0:64 . Each 
gene is preceded by potential Shine-Dalgarno sequence: AGGA 
at nucleotides 3532-3535; AAGA at 4356-4359, and AGGA at 
5727-5730. The initiation codons of rfbB 0:64 and rfbC Q:S4i each 
overlap with the end of the preceding gene, suggesting that 
translation of the three genes is coupled. No consensus - 10 
and -35 E. coli promoter sequences were detected in the region 
immediately upstream of r/5Ao :64 . To localize the rfb 064 pro- 
moter, the 1.14-kb Hincll-Sspl fragment of pWQ799, contain- 
ing rfbA 0:64 and a 90-base pair 5 '-flanking region, was cloned 
in front of the promoterless cat gene of pKK232-8. When 
transformed into E. coli K-12, the resulting plasmid (pWQ822; 
Fig. lO conferred chloramphenicol resistance, indicating the 
presence of an endogenous promoter. The transcription start 
site was mapped by primer extension of the oligonucleotide 
TS154 using total cellular RNA from E. coli DH5a (pWQ820; 
Pig. lO. A single band was obtained in the resulting autora- 
diogram (data not shown), identifying the start site as nucleo- 
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Fig. 1. Structure and organization of pWQ799 and the r/&o*4 cluster. A, base composition plot of pWQ799. The average % G + C for the 
plasmid replicon sequences (nucleotides 1-3384) is 54%. The arrow indicates the beginning of rfb 0:64 , where the % G + C drops to 39%. B, 
organization of pWQ799. The size, location, and direction of transcription of the rfb 0:M orfs are indicated by the open boxes under the stippled region 
of the map. The functional regions of the pWQ799 replicon are presented for clarity; ori represents the origin of replication; bom is the origin of 
transfer; mob is the mobilization region. The flags represent the site of TnphoA fusions; the open flag indicates PhoA~, and the filled flag indicates 
PhoA*. C t physical maps of p las raids used in this study. For simplicity, only the insert DNA is shown. The inserts in plasmids pWQ819 and 
pWQ823 are under the control of the trc promoter of vector pTrc99A Expression of the insert in pWQ820 may be driven by the T7 promoter of vector 
pGEM-7Zfl>). The presence of the endogenous r/5 0:54 promoter in plasmid pWQ822 is indicated by expression of the promoterless cat gene in the 
vector pKK232-8. aph denotes the acetyltransferase gene that confers kanamycin resistance. 



tide G at position 3474, 73 nucleotides upstream of the rfbA^y ^ 
start codon. 

Previous studies of 0:54 O-polysaccharide expression in 
strains containing pWQ800 (Fig. 1C) revealed a requirement 
for a functional rffE determinant in an E. coli host (3). This 
gene is part of the chromosomal enterobacterial common anti- 
gen (ECA) biosynthesis cluster and encodes the enzyme UDP- 
GlcNAc-2-epimerase (EC 5.1.3.14, UDP-N-acetyl-D-glucosa- 
mine 2-epimerase) (5). The epimerase converts UDP-GlcNAc to 
UDP-ManNAc, an intermediate in the formation of a precursor 
for ECA biosynthesis. UDP-ManNAc is also the presumed pre- 
cursor for 0:54 synthesis. Sequence analysis of pWQ799 indi- 
cated that the EcoRl site used in cloning of pWQ800 maps 
within rfbC 0:54 . Subsequent mutant complementation analysis 
with an rfbC 0:64 + subclone has shown that RfbC 0:54 is a func- 
tional homologue of RffE and is therefore not essential for 0:54 
synthesis in members of the Enterobacteriaceae, all of which 
synthesize ECA (28). The remaining two rfb 0:64 gene products, 
RfbAo-54 and RfbB 0:M , are unique and essential for 0:54 po- 
lysaccharide assembly and their structures and functions are 
described below. 

Structural Analysis of RfbAo :54 — Sequence analysis of rf- 
feAo-54 predicts a 274 amino acid protein with a molecular mass 
of 31.8 kDa and a pi of 9.76. Hydropathic analyses (18) iden- 
tified one transmembrane helix at the carboxyl terminus of the 
protein (residues 227-251). To determine the topology of Rf- 
bAo :M , a promoterless leader sequence-deficient TnphoA gene 



was inserted in-frame at a position corresponding to amino acid 
21 of R£bAo :64 (Fig. IB). E. coli CC118 (pWQ800B6) was 0:54- 
deficient and PhoA-negative on indicator media, indicating 
that this region of the protein is present in the cytoplasm. This 
cytoplasmic location is in agreement with the function of Rf- 
bAo^ (see below) and with protein topology predictions by the 
positive-inside rule. This rule allows the prediction of the to- 
pology of a bacterial inner membrane protein based on the 
observation that positively charged amino acids (Arg + Lys) 
are more abundant in cytoplasmic loops (i.e. —15%) than in 
periplasmic loops (-5%) (29). The Arg + Lys content of this 
region was 14%. 

The translated amino acid sequence of RfbAo^ shares sig- 
nificant homology with a number of putative bacterial glyco- 
syltransferases (Fig. 2 and Table I). RfbAo :64 shares 32% iden- 
tity with the predicted product of the r/Z>Ec07 & ene orf275 (30), 
26% identity with the hypothetical protein 6 of the Isg locus of 
Haemophilus influenzae, and 25% identity with the AmsE pro- 
tein of Erwinia amylovora (31). Analysis of the protein align- 
ments of these predicted products revealed that the sequence 
conservation was relatively uniform throughout the length of 
the proteins (Fig. 2). A number of protein sequences in the data 
bases were also identified with significant levels of homology 
over the N-terminal 192 amino acids of RfbAo :64 . Alignment of 
this region gave identity levels of 21 and 20%, respectively, 
with the ExoU and ExoO proteins from Rhizobium meliloti 
(32-34) and 21% with the LgtA glycosyltransferase from Neis- 
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RfbA M^QFTVCMWTKNDAAEHFN^^ IANIG 

ArasE MFSVLISLY^ttEKP^2^LBQCLESIi^QQ^^ 

Orf275 MSDDTWFSVIJlAIYlia)SPLFL^^ 

Leg HKFSVI*ELYIKQ^PIJ?ECFESLVA^^ 



Fig. 2. Multiple sequence align- 
ment of RlbAo^ and homologous 
proteins. Complete protein sequences 
are shown. References and accession 
numbers for the aligned proteins are 
listed in Table I. Identical amino acids are 
indicated by asterisks; similar amino ac- 
ids are indicated by the dots. 



RftA 
ArasE 
orf275 
Lsg 



RfbA 
AmsE 
or f 27 5 
Lsg 



RfbA 

AmsE 

orf275 

Lsg 



DELSI RHJUAHLSSVGQARNIAIilAAJW^ PELN — DSVR 

NIIfLvTVPI£KNIXajSKAtt^^ 

TKTJ>LKLVKLPQplRGIiC3CAI^EGr^ 



HVGAWVA^SREraiDRCSVRRVPCEFVDimSR 
LSGAAVTEFDEHOT--RIXRLK£^IHE^^ 
ILSSWVEHTOWPGDKGIIKKVPSR^ 
IFQQGTAEFGIWVNDIW 



UtftiEDYULVTOFIRNSKIIRNI 

L-YMEX>YNLWLRIMSIiGHPVANLPt^ — RRGVWYIKSEVQLYRLKLA 

Y -LYEDYALtiaJ^IANSa^aCNI QQVLVEMRPSKETAKR- - RGGIKYAISEIKAQYHFYR 
--LQEDYYI^KLVRQGLYMANL^^ 



RftA NRYISLUDFFMIVriKILVRLLP^^ 

AmsE LKQTGFIRGTLYFLI RTWITUiCVKVMQIFLYEKDRKG- - 

or f 27 5 ANYISYQDFIINIITOIFV^uXTOFRCyiYKKWRilFL 

Lsg I/5IQCLLSGIJ^AIJ*K^RIXP^ - 



Table I 

Members of the ExoU family of nonprocessive fi-gly cosy lira nsf erases 



Protein 



Putative or known activity 



Organism 


Accession no. 


Ref. 


S. enterica sv. Borreze 


L39794 


This study 


E. coli 07 


L04596 


30 


Erwinia amylovora 


X77921 


31 


E. amylovora 


X77921 


31 


Haemophilus influenzae 


M94855 


Unpublished 


Neisseria gonorrhoeae 


U14554 


35 


N. gonorrhoeae 


U15992 


35 


N. gonorrhoeae 


U15992 


74 


H. influenzae 


L46209, L42023 


75 


H. influenzae 


L45506, L42023 


75 


Shigella dysenteriae 


S73325 


76 


Yersinia enterocolitica 03 


Z47767, X63827 


77 


Bacillus subtilis 168 


U13979 


Unpublished 


Staphylococcus aureus Newman 


1340131 


Unpublished 


Streptococcus thermophilus Sfi6 


U40830 


78 


It meliloti 


L20758 


33,34 


R. meliloti 


L20758 


33,34 


R. meliloti 


L20768 


33,34 


R. meliloti 


L20758 


33,34 


R. meliloti 


L20758 


33,34 


S. cerevisiae 


J04184 


54 


Anabaena Bp. strain PCC 7120 


M31722 


79 



RfbAo.54 Adds first ManNAc residue during synthesis of the 0:54 

O-polysaccharide 

Or£275 Required for synthesis of the 0-7 O-polysaccharide 

AmsE Required for synthesis of amylovoran 

AmsB Required for synthesis of amylovoran 

Lsg6 Involved in LOS synthesis 

LgtA Adds GlcNAc 01-+3 to Gal; Involved in LOS synthesis 

LgtD Adds GalNAc 01->3 to Gal; involved in LOS synthesis 

Lsi-2 Adds GlcNAc 01->3 to Gal; involved in LOS synthesis; 

homologue of LgtA 

LgtD Putative glycosyltransferase involved in LOS synthesis 

LgtD Putative glycosyltransferase involved in LOS synthesis 

RfpA Involved in synthesis of type 1 O-polysaccharide 

TrsC Putative glycosyltransferase involved in outer core LPS 

synthesis 

GgaB Involved in secondary teichoic acid synthesis 

Orf4 Involved in staphyloxanthin synthesis 

EpsI Putative glycosyltransferase 

ExoU* Adds Glc 01— >6 to Glc; involved in succinoglycan synthesis 

ExqO° Adds Glc 01-*6 to Glc; involved in succinoglycan synthesis 

ExoA a Adds Glc 01-»3 to Gal; involved in succinoglycan synthesis 

ExoM° Adds Glc 01-»4 to acetylated Glc; involved in 

succinoglycan synthesis 

ExoW° Adds Glc 01-*3 to Gal; involved in succinoglycan synthesis 

Dpml° Dolichol mannose phosphate synthetase 

Orf2° Unknown 



a Original member of the ExoU family, as defined by Saxena et al. (36). 



seria gonorrhoeae (35) (Table I). 

Previous studies of glycosyltransferases have demonstrated 
that there is often insufficient sequence similarity for func- 
tional predictions using traditional sequence alignments (36). 
However, transferases which catalyze the formation of glyco- 
side linkages with the same stereochemistry and with struc- 
turally related substrates are predicted to share a similar 
three-dimensional architecture in the catalytic and binding 
domains. This would be reflected in the presence of conserved 
structural regions or sequence motifs for shared mechanistic 
functions. Such domains can be identified using HCA (36, 37). 
This method plots the two-dimensional pattern of protein se- 
quences and allows visual comparison and detection of con- 
served structural features. Using HCA, Saxena et al. (36) com- 
pared the two-dimensional structure of five different 
glycosyltransferases of known catalytic functions, including 
ExoO and ExoU from Rhizobium meliloti. They identified a 



structural region, domain A, that was present in all five pro- 
teins. The transferases all possess a common catalytic activity: 
formation of a single glycosidic linkage with a ^-configuration 
from a-linked nucleotide diphospho sugar donors. Domain A is 
therefore speculated to be directly involved in this shared ac- 
tivity. Since the region of similarity detected between RfbAo :54 , 
ExoU, and ExoO includes the region containing domain A in 
the Rhizobium proteins, HCA was used to compare RfbAo.^ 
with ExoU and the other RfbAo :54 homologous proteins (Fig. 3). 
All of the proteins examined possessed the domain identified by 
Saxena et al. This domain is characterized by a series of verti- 
cal hydrophobic clusters typical of 0-strands alternating with 
clusters characteristic of a-helices (36). Two conserved motifs 
were identified within the regions analyzed: (DXDD), at the 
C-terminal end of the 04 region of domain A, and EDY, which 
lies an average of 94 amino acids downstream of the (DXDD) 
motif, within another region of structural conservation. The 
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domain A 



ExoU 



RfbA^ 



Orf275 



Lsg 



AmsE 




10 20 30 40 50 

-J — i l i l 



70 SO 90 100 110 120 130 140 150 160 170 160 190 200 
I I I I I I I I I I I I I l_ 



amino acid residue 

Fig. 3. Alignment of HCA plots of ExoU, RfbAo^, and RfbAo^-related proteins. Hydrophobic cluster analysis was performed using the 
HCA-Plot program (Doriane Inform atique, Le Chesnay, France). This program writes protein sequences on a duplicated a-helical net and circles 
clusters of hydrophobic amino acids (Ala, Val, Leu, He, Met, Phe, Gin). The plots are then visually compared for similarity in the hydrophobic 
cluster patterns, limi tin g analysis to the predicted globular portions of the proteins. Plots were aligned using the results of amino acid sequence 
alignments as a starting point. Hydrophobic clusters with obvious similarities were used as anchors for the structural alignment, as were regions 
containing glycines ( ♦ ) and prolines (*), which are often present in loops (21). Vertical lines were drawn to indicate structurally conserved features. 
The prediction of 0-strands and a-helices is based on the observed association of specific hydrophobic cluster shapes with secondary structures (21). 
Amino acid denoted with one-letter code except for proline (*), glycine ( ♦ ), serine (ED, and threonine (□). Conserved residues are circled. Regions 
ir-iv indicate structural regions that appear to have a conserved two-dimensional architecture surrounding the (EDY) motif; these regions were not 
described in the initial characterization of the ExoU family. 



first motif includes a conserved Asp residue previously identi- 
fied by Saxena's group. This residue falls in a loop following a 
0-sheet, and based on this conserved location and the acidic 
nature of the residue, Saxena et a/, have speculated that this 
may be a catalytic residue. As noted by Saxena et al. (36), the 
proteins were also found to contain at least one Asp residue in 
the loop at the C-terminal end of the 02 strand (Fig. 3). The 



other protein sequences obtained from the data base search 
with RfbAo :64 were also examined for the presence of these 
conserved features (Fig. 4). A total of 15 proteins, in addition to 
the 7 originally described by Saxena et al. (36), were identified 
as being members of the ExoU family (Table I). 

RfbA Q:B4 Is a Monofunctional N-Acetylmannosaminyltrans- 
ferase — The observed sequence and structural homology of Rf- 
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Catalytic site 1 



ExoU 36 A-KVWID-§GSTDDSASVA 

RfbA 34 PSBVLIV- -jf^NGGLTDRHYE 

ExoO 39 TVBWWD-icSADATPALV 

ExoM 37 RLR V I VADNSAE P S ARAL VE 

ExoW 32 DFHVLVID-j^ESPYPIADEL 

ExoA 38 ItNARWIA-jOGGSTDGTREI 

AmsE 31 ADEIVLVY-DGPVSESLKAV 

orf275 37 PDEVIIIR-ioKVTSELNSV 

Lsg 32 ADEIVLVF-goWTPDLEFV 

Dpml 36 KTEL I FVD - gNSQDGS VEEV 



62 RLNWRFEENRGPi 

63 RLLRIAHLSSV- -I 
66 RLIALDRNRGP-- 
66 ILYVHCPHSNI - - S I! 
60 RITVIRQPNGGP 
66 RVLFLDNPKRIQ-a 

60 VIVPLEKNLGL--G] 
66 KDFTLEKNMGL- - 

61 KLVKLPQNRGL- -G 

65 R I I VRTNERGL - - SS 



Catalytic site 2 
* *# 

[HAIA- -ISHSPLIGVL^A DlPFFPGRLGQL 

[IALH- - AAR YQ V I AVADP DllNEPDRFIKL 

lGIG- -aargrwiavlds D§TVRPDRLRRM 

fCCLD- -NSTGDFLAFLi DpETVSGDWLTR 

jldnvpadsdfvaflSs dSvwtpdhllna 

lVaelgagsdyl i r idahgtypdSycerl VEDAL 

HE - -RCTHNWARMgT uflCLPERFEKQ 

JLN- -QCMHDLVIRAgS DflNRTNRFECI 

:oll- -hcdydwvfrmIt dId I CVPDRFEKQ 

jjVLKGFY- - EAKGQ YLVCMSA |LQHPPETVPK 




Binding site l 



ExoU 

RfbA 

ExoO 

ExoM 

ExoW 

ExoA 

AmsE 

orf275 

Lsg 

Dpml 



190 
178 
192 
184 
176 
180 
166 
173 
169 

168 



LG 

FG - 

IG - 

GG ■ 

IGRKLFEKVRFEATLK 

VG GYDESFSH 

AG SYQHHLY 

VG --GYGNEYLY 

COG ■ 



oydlyaralangaryki ihscgyaawrgnslsgshrtidlkgly 

dlvtrfirngki irnipivmvnfrig dakklykkrgg 

| y i llas alacggalrgravrrlhls yprg fhlacvearshrpdd 
Itdfftg^caggtiafspeawvhepvpenraslawlakrrfrsg 
|v - lffcdcvlas kr wlcdaagavrgegln i fhs i dnds pqflk 
§aeldyrix5kaqraiwmtdktsmvyypraklvplfwqyfgyg|-g 

gynlwlrimslghpvanlpdvlmkvrag sdmvnkrrg 

^ stalwlkslangcngdniqqvlvdmrfs ketakirg 

- yedlo^yylwi klvaqgl ymanl pd i lvyarvg ngmvsrrrg 

YLENCNPRDINSO^FKIALELLAKLPLPRDPRVAIGEVPFTFGVRTEGESKLSGKVI iqylqqlk 



Fig. 4. Multiple sequence alignment of conserved structural regions in ExoU, RfbA^^, and related proteins. Conserved residues are 
indicated by boldface. Shaded boxes denote residues that are 1009b conserved. Putative catalytic and binding sites are indicated above the 
alignment and were identified based on their highly conserved location and shared features with catalytic and binding sites of inverting 
glycosidases. 



hAo^ with other glycosyltransferases suggested that rfbAQ :B4 
encodes a ManNAc transferase (EC 2.4.1 JO. However, two 
separate ManNAc transferase activities are required for the 
generation of a ManNAc homopolymer with alternating 01-*3 
and 01— >4 linkages. In other homopolysaccharide O-antigens 
where polymerization is initiated by Rfe, the transferase which 
adds the first residue to the Rfe intermediate is distinct from 
the enzymes which subsequently polymerize the O-polysaccha- 
ride (8, 9). A third transferase activity is therefore predicted for 
the transfer of the first ManNAc residue to undecaprenol-P-P- 
GlcNAc, the product of the Rfe reaction (5). To determine which 
^0:64 S ene product was responsible for the initial ManNAc 
transfer, plasmids carrying either rfbAo-^ (pWQ823; Fig. lO 
or rfbB Q:64 (pWQ819; Fig. lO were transformed into different 
E. coli K-12 backgrounds, and LPS in the whole cell lysates of 
the transformants was analyzed by SDS-PAGE (Fig. 5). In E. 
coli DH5a (rfe+rffE + ), neither gene was sufficient for 0:54 
synthesis and expression of a ladder of O-antigen-substituted 
LPS. Introduction of the two genes together on plasmid 
pWQ802, either in strain DH5a or the ^b-delete strain S<f>874, 
was sufficient for expression of authentic 0:54 O-polysaccha- 
ride. These results were confirmed by Western immunoblot 
using absorbed polyclonal 0:54 antisera (data not shown). 
Analysis of the LPS core regions in SDS-PAGE profiles re- 
vealed that in the presence of rfbAQ :64 , an additional LPS band 
was synthesized; this band migrated slightly slower than the 
lipid A core fraction of the host strain. In contrast, LPS from 
the r/6B 0;M -containing strain was indistinguishable from that 
of the parental strain. Further analysis determined that the 
RfhAo^-mediated band was not synthesized in an rfe~ host 
strain, as would be expected for a biosynthetic pathway initi- 
ated by Rfe (Fig. 6A). The structure of the E. coli K-12 core 
oligosaccharide has been determined (38, 39) and does not 
include any ManNAc residues. The demonstration of ManNAc 
in the RfbAo :64 -modified core would therefore be indicative of a 
ManNAc transferase function catalyzed by RfbAo ;M . Purified 




Fig. 5. Analysis of the effect of RfbAo^ and RfbBo^ on LPS in 
whole cell lysates. Silver-stained SDS-PAGE from LPS of strains 
containing either rfbAo^ (pWQ823), rfbB Q M (pWQ819), or rfbAo.^ 
and rfbBo. M (pWQ802). 

core oligosaccharide from E. coli DH5a (pWQ823) was there- 
fore analyzed by high performance anion-exchange chromatog- 
raphy. A single additional peak corresponding to man- 
nosamine, the acid-hydrolyzed product of ManNAc, was 
detected in the chromatogram of the RfbAo :54 -dependent core 
oligosaccharide (data not shown). 

To determine the size of the RfbAo ^-dependent band, the 
LPS from E. coli DH5a (pWQ823) was analyzed by SDS-PAGE 
alongside two LPS samples which each contain a modified core 
fraction of known size and composition (Fig. 6B). Plasmid 
pJK2363 contains the Shigella dysenteriae galactopyranosyl- 
transferase gene, rfpB (40). Rfe + K-12 host strains expressing 
rfpB synthesize a core oligosaccharide modified by the addition 
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Fig. 6. SDS-PAGE analysis of the RibA^-mediated core mod- 
ification in whole cell lysates of different E. coli K-12 host 
strains. A, analysis of the effect of host functions on RfhA^^-depend- 
ent core modification. Strains and plasmids are indicated above each 
lane. B, determination of the size of RfbA^^-modified core. In E. coli 
DH5a, plasmid pWQ20 directs the synthesis of a core fraction modified 
by the addition of three sugars (core +3); pJK2363 directs the formation 
of a core fraction modified by the addition of two sugars (core +2). 

of the disaccharide la-r>Ga!p-(l->3)-p-D-GlcpNAc-(l-»] (41). 
Plasmid pWQ20 contains the rfbFj^j^ gene from Klebsiella 
pneumoniae 01, and E. coli K-12 strains containing rfbFj^x 
produce a core oligosaccharide modified by the addition of the 
trisaccharide 1/3-D-Gal/ 7 -(1^3)-a-D-Ga]p-(l->3)-0-D-GlcpNAc- 
(1->J (8). The comigration of the RfbAo :54 -modified band with 
the *core + 2" band of E. coli K-12 (pJK2363), combined with 
the Rfe dependence of this band and chemical compositional 
analysis all indicate that the RAbA^^-dependent band repre- 
sents lipid A core modified by the addition of a single GlcNAc 
and a single ManNAc residue. 

The RfbAo ^-modified lipid A core fraction was not present 
in E. coli S<f>874 (pWQ823), indicating that synthesis of this 
LPS fraction requires one or more function(s) encoded by the 
K-12 rfb gene cluster (Fig. 6A). Expression of 0:54 LPS in 
S<f>874 (pWQ802) indicates that this requirement is overcome 
by the simultaneous activities of rfbA 0:&4 and rfbB Q:64 . Ligation 
of newly polymerized O-chains to lipid A core occurs at the 
periplasmic face of the plasma membrane and must therefore 
follow trans-plasma membrane transport of the undecaprenol- 
bound intermediate. A similar /y6 K _ 12 -dependent transport ac- 
tivity has been demonstrated for the RfbF Kp01 and RfpB-me- 
diated core modifications (8, 42). The E. coli K-12 O-antigen 
transporter, RfhX, is believed to be responsible for the trans- 
port of the lipid-bound R^Fr^ and RfrB products (8, 42, 43). 
Given the nature of this transport event and the involvement of 
both host and cloned plasmid functions, it was possible that the 
appearance of a single modified core band in E. coli DH5a 
(pWQ823) resulted from a substrate size limitation imposed by 
RfhX, rather than the monofunctional transferase activity of 
RfbAo :M . In this case, lipid-linked intermediates with higher 
degrees of polymerization would be synthesized but would re- 
main in the cytoplasm, attached to carrier lipid. To address this 
possibility, the O-polysaccharide ligase-deficient strain E. coli 
CS2334 was transformed with pWQ823. Under such conditions 
any O-polysaccharide that is formed, but not transported, 
should accumulate in the cytoplasm. E. coli CS2334 (pWQ823) 
was phenol-extracted, and because the polymeric 0:54 LPS 
partitions primarily into the organic phase (3), both phases 



were examined. The linkage between undecaprenol and carbo- 
hydrate polymer is phenol-labile (43) and, as a consequence, 
extracted O-haptenic material remains in the supernatant fol- 
lowing a 100,000 x g centrifugation step. Supernatants from 
both the aqueous and phenol phases were size-fractionated on 
a Sephadex G-50 column, and fractions containing amino sug- 
ars were analyzed by *H NMR. Although extracted high mo- 
lecular weight ECA was detected in these experiments, no 
ManNAc-containing polymer was present (data not shown). 
These data confirm that RfbAo.^ transfers a single ManNAc 
residue. 

Sequence Analysis ofRfbB 0:S4 —RfbB 0:54 is predicted to be a 
53.3-kDa protein composed of 459 amino acids, with a calcu- 
lated pi of 8.32. Hydropathic analysis predicted four trans- 
membrane helices between residues 11-40, 325-340, 385-406, 
and 416-438. This protein is therefore expected to be an inte- 
gral membrane protein. Based on the relative distribution of 
positive amino acid residues, the hydrophilic region defined by 
residues 340 and 384 is predicted to lie in the periplasm (6% 
Arg + Lys). This location was confirmed by construction of an 
in-frame PhoA-positive fusion (pWQ800B22) at amino acid 368 
(Fig. IB). 

The demonstrated activity of RfbAo :64 combined with the 
observation that RfbAo :54 and RfbB 0:54 together are sufficient 
for synthesis of polymeric 0:54 LPS suggests that RfbB 0:64 is a 
processive 0-glycosyltransferase (EC 2.4.1 JO. Searches of the 
data bases with the RfbB 0:64 sequence identified only one pro- 
tein, IcaA (44), with significant sequence homology over its 
entire length (22%). IcaA is a putative glycosyltransferase from 
the ica (intercellular adhesion) locus of Staphylococcus epider- 
midis. This protein is believed to play a role in the synthesis of 
the linear 0-1,6-linked glucosaminoglycan involved in intercel- 
lular adhesion. The predicted size and topology of this protein 
resembles that of RfbB 0:54 . A number of other proteins were 
identified that possess regions of conservation with the large 
central hydrophilic domain of RfbB 0:54 (Table II). Most of these 
proteins are glycosyltransferases involved in the synthesis of 
bacterial cell surface polysaccharides. Two of these proteins, 
Cps3S and HasA, are also predicted to be integral membrane 
proteins with similar hydropathy plots and four predicted 
transmembrane domains. Cps3S is a glycosyltransferase that 
directs the synthesis of the type 3 capsule of Streptococcus 
pneumoniae (45, 46). The enzyme is Afunctional and proces- 
sive, catalyzing the formation of [-*3)-0-d-G1cA-(1-»4)-0-d-G1c- 
(1->] W (45). HasA is the hyaluronic acid synthase from Group A 
Streptococcus pyogenes (47, 48). This protein is also a bifunc- 
tional processive 0-glycosyltransferase, catalyzing the forma- 
tion of a polysaccharide with the structure [-*4)-0-d-GIcA- 
d^-p-D-Glc-U-*],, (47). 

Using HCA, Saxena et ah (36) compared the plots of HasA 
with a number of known processive ^-glycosyltransferases. 
They reported a correlation between the presence of two con- 
served structural regions (domains A and B) and the shared 
catalytic activity. Domain A is common to both the ExoU and 
HasA families of 0-glycosyltransferases; whereas domain B, 
located a short distance downstream of domain A, is unique to 
the HasA family. A single conserved Asp residue in region II 
and a conserved sequence motif (QXXRW) in region IV were 
both reported to characterize domain B. Comparison of the 
HCA plots of the large hydrophilic domains of RfbB 0:64 , Cps3S, 
and IcaA, with that of HasA, confirmed the presence of both 
domains in these proteins (Fig. 7). All three proteins possessed 
the conserved Asp and (QXXRW) motif of domain B and with 
the exception of Cps3S, the two conserved Asp residues char- 
acteristic of domain A were also present in RfbB 0:64 and IcaA 
(Fig. 7). Although the Asp-04 is conserved in Cps3S, there is no 
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Table II 

Members of the HasA family of processive fi-glycosyltransferases. 



Protein 


■ _ 
Putative or known function 


Organism 


Accession 
no. 


Reference 


RfbB 0:54 


Polymerizes 0:54 O-polysaccharide; transfers ManNAc to ManNAc in 


S. enterica sv. Borreze 


L39794 


This study 




alternating 01— *3 and 01— *4 linkages 








ueuuDiuronic acia syntnase; syntnesizes l— >oj-p-D-LrlcAK l~^4)-p-D-Lrlc-( 1— 


S. pneumoniae 


Z47210 


46 








U15171 


45 


LCOA 


Involved in intercellular adhesion 


S. epidermidis 


U43366 


44 


fimsrt 


Membrane protein involved in autoagglutination 


Y. pestis 


U22837 


Unpublished 




Polymerizes Nod factor oligosaccharide; transfers GlcNAc 01— »4 to GlcNAc 


Rhizobium loti 


X52958 


80 


NodC 


Nod factor oligosaccharide synthesis 


It galegae 


X87578 


Unpublished 


NodC 


Nod factor oligosaccharide synthesis 


R, meliloti 


X01649 


81 








Ml 1268 


oz 


NodC 


Nod factor oligosaccharide synthesis 


R. leguminosarum 


Y00548 


83 


NodC 


Nod factor oligosaccharide synthesis 


R. leguminosarum bv. phaseoli M58626 


84 


NodC 


Nod factor oligosaccharide synthesis 


Azorhizobium caulinodans 


L18897 


85,86 


FbfA 


Involved in fruiting body morphogenesis; related to NodC proteins 


Stigmatella aurantiaca 


Z11601 


Unpublished 


Chs2 


Chitin synthase; transfers GlcNAC 0(l->4) to GlcNAc 


Candida albicans 


M82937 


65 


Call 


Chitin synthase 


S. cerevisiae 


X57300 


68 


Chs3 


Chitin synthase 


C. albicans 


D13454 


67 


Csd2 


Chitin synthase 


S. cerevisiae 


M73697 


64 


ChsE 


Chitin synthase 


Emericella nidulans 


U52362 


Unpublished 


Chs-4 


Chitin synthase 


N. crassa 


U25097 


Unpublished 


CelA 


Cellulose synthase; transfers Glc 01-+4 to Glc 


Agrobacterium tumefaciens 


L38609 


60 


HasA* 


Hyaluronic acid synthase; synthesizes [->4)-0-GlcA-<l->3)-0-GlcNAc-(l->l n 


Streptococcus pyogenes 


L21187 


47,48 


Alg8" 


Involved in alginate synthesis; exact function unknown 


Pseudomonas aeruginosa 


L22611 


62 


Chsl" 


Chitin synthase 


S. cerevisiae 


M14045 


63 


AcsAB" 


Cellulose synthase 


Acetobacter xylinum 


X54676 


53,59 


Dg42° 


Function unknown 


Xenopus laevis 


M22249 


87 



* Original member of the HasA family, as defined by Saxena et al (36). 



Asp immediately next to the 02 sheet of the protein; however, 
an Asp residue is located in the middle of a2 (Fig. 8). A search 
of the data bases identified a total of 17 other proteins, in 
addition to the 6 originally identified by Saxena et al., that 
possess the conserved features of the HasA family (Table II). 

DISCUSSION 

Plasmid pWQ799 carries the entire 0:54 biosynthetic cluster 
and consists of three genes, r/bAo :54 , rfbB 0:64> and rfbC Q:64 . 
Together these genes direct the synthesis of a unique Salmo- 
nella O-polysaccharide. The 0:54 polysaccharide is the only 
known homopolymeric O-antigen in Salmonella (49), and while 
the presence of N-acetylated amino sugars is not an unusual 
feature for Salmonella O-polysaccharides, ManNAc is found 
only in 0:54. It is therefore not surprising that the predicted 
products of rfb 0:64 showed no homology to other Salmonella 
protein sequences in the data bases. 

RfbAo.^ is an N-acetylmannosaminyltransferase. This en- 
zyme transfers the first ManNAc residue to undecaprenolpyro- 
phosphoryl-GlcNAc, the product of the Rfe reaction. The in- 
volvement of Rfe (UDP-GlcpNAc::undecaprenolphosphate 
GlcpNAc-l-phosphate transferase) in the initiation of 0:54 bio- 
synthesis resembles the situation observed with other homopo- 
lysaccharide O-antigens (7, 10, 12, 50), where biosynthesis is 
initiated by Rfe but GlcNAc does not form part of the O-unit. It 
is unclear if this type of initiation reaction is necessarily lim- 
ited to O-polysaccharides. RfbAo^ shares significant sequence 
homology with a number of glycosyltransferases in the data 
bases. Two of the proteins, ExoO and ExoU from R. meliloti, are 
0-glycosyl transferases that catalyze 0-1,6 linkages from 
or-linked nucleotide diphospho sugar donors. Alignment of the 
HCA plots derived for RfbAo^ and 4 RfbAo :M -related proteins 
with the previously characterized plot of ExoU (36) revealed 
that all of the proteins possess the same N-terminal structural 
region (domain A) identified by Saxena and co-workers (36) in 
ExoU and a number of functionally related transferases. This 
domain is believed to be responsible for the nonprocessive 
0-glycosyltransferase activity of proteins within this family. 
Given their common activities and the conservation of domain 



A among the members of the ExoU family of glycosyltrans- 
ferases, it is likely that this region represents at least part of 
the catalytic domain of these enzymes. We identified two highly 
conserved motifs among the aligned HCA plots. The first (DX- 
DD),falls in the 04 region of domain A and includes a conserved 
Asp residue previously identified by Saxena and co-workers 
(36). Saxena et al. (36) also identified a second conserved Asp 
residue in domain A, in the loop at the C-terminal end of the 02 
strand of domain A. HCA alignments revealed that all of the 
proteins possess either one (RfbAo-^, orf275, Lsg6, and AmsE) 
or two Asp residues (ExoU) in this region, although the position 
is not strictly conserved and an Asp may occupy the first and/or 
the second site within this loop. 

Very little is known about anabolic glycosyl transfer reac- 
tions, but it has been proposed that, mechanistically, this type 
of reaction may be viewed as the reverse reaction of the glycosyl 
transfer reaction performed by O-glycosidases, the difference 
being that the result is the extension rather than hydrolysis of 
an oligosaccharide or polysaccharide chain (51). By analogy 
with the extensively characterized polysaccharide hydrolase 
systems (reviewed in Refs. 51, 52), this hypothesis predicts that 
formation of a 0-glycosyl linkage from an a-linked sugar nucle- 
otide donor would involve the same type of catalytic event as 
that of the inverting glycoside hydrolases. Hydrolysis of glyco- 
side bonds by inverting glycosidases results in a net inversion 
of configuration at the anomeric center of the reducing sugar 
product. The catalytic mechanism involves two acidic active 
site amino acids that act as acid-base catalysts. Among the 
cellulases and xylanases, in every instance where the catalytic 
residues have been identified, the amino acid has been either 
an aspartate or a glutamate (52). One of these residues is 
believed to act as the acid catalyst to protonate the substrate, 
whereas the other is thought to act as the base catalyst by 
deprotonating water. The two catalytic residues are located in 
flexible loop regions in the active site cleft, between substrate- 
binding subsites. Characterization of RfbAo^-related proteins 
in data bases has identified 15 new members of the ExoU 
family, in addition to the 7 originally described by Saxena et al. 
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Fig. 7. Alignment of the HCA plot of HasA with plots of RfbBo^ and related proteins. Plots were generated using the method outlined 
for Fig. 3. Conserved residues are circled. 



(53). These new members all possess two conserved Asp resi- 
dues within the putative catalytic domain (Fig. 4). The second, 
in the terminal loop of 04, falls in a more strictly conserved 
region of the proteins and is surrounded by additional acidic 
residues. In RfbAo.^, sequence and HCA plot alignments pre- 
dict that the catalytic residues are Asp-41 and either Asp-93, 
Aflp-95, or Asp-96. 

HCA also identified a conserved motif (EDY) downstream of 
domain A, in a region containing amino acid clusters with a 
shape typical of a-helices (21). Carbohydrate ligands are be- 
lieved to interact with binding subsites through a combination 
of ionic interactions, hydrogen bonding, and stacking interac- 
tions between aromatic amino acids and the hydrophobic 
patches of carbohydrate monomers (reviewed in Ref. 52). The 
conservation of the (EDY) motif and its position within a region 
of structural conservation, a short distance from a putative 
catalytic residue, suggests this motif might represent part of a 
binding subsite. Multiple sequence alignment revealed that a 
similar motif occurs in other members of this family (Fig. 4) 



although the order of the first two acidic amino acids may be 
reversed and in some cases the third amino acid may be a Lys. 
Although Tyr is an aromatic amino acid, both it and Lys are 
structurally similar, in that both have long chains and may 
participate in hydrogen bonding through either a terminal 
hydroxyl (Tyr) or amino (Lys) group. The acidic nature of Asp 
and Glu would also allow hydrogen bonding of these residues 
with a carbohydrate ligand. Given the predominance of aro- 
matic amino acids surrounding the (EDY) motif, it is possible 
that the surrounding residues participate in stacking interac- 
tions with the hydrophobic patch of the glycosyl monomer. The 
only protein that did not possess the (EDY) motif was Dpml. 
This enzyme catalyzes the transfer of mannose from GDP-man 
to dolichol phosphate (54). The fact that the ligand recognized 
by Dpml is not a carbohydrate is further support for the hy- 
pothesis that this motif is involved in substrate binding. 

RfbBo:M is predicted to be an integral membrane protein 
with four transmembrane helices, a large central hydrophilic 
domain and a periplasmic loop. Searches of the data bases 
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Catalytic Site 1 Catalytic Site 2 



Has A 94 LSEIYIVD- ftGSSNTDA 

RfbB 64 LFDIVWN gNSTDRTG 

Cps3S SI SEIIWINGPK- -NERLVKLCHSFNEKLEN 

NodC 80 PITIWVD -pOSTNKTS 

Chsl 4 52 KIWCIIS - -pfORSKINE 

Dg42 131 LKIILVIDGNTEDDAYMMEMFKeVFHGEDV 

Alg8 122 GYPTTWC IVEMS-DEVLVRSL 

IcaA 75 EKEIIIIN pQSSDNTA 

Ac SAB 180 KVRVHILD §QRRPEFA 



134 SDADVFLTVfiSDTY I YPNALEELLKSFN - -DETVYAATG- - HLNARN 
128 ENYDLVMVljADNFVDANILTELNSQWI - -SKDKPEAIQA- YLDCKN 
105 SQSDITVL\™SDTVWTPRTLSELLKPFVC-DKKIGGVTT--RQKILD 
113 TDGDL I LNVDSDTVIDKDWTKLASSMR - - APNVGGVMG - - QLVAKN 
532 LRPN I VTLL||AGTMPGKDS I YQLWREFR - - NPNVGGACG - -EI RTDL 
198 TS VD YVQVraSDTKLDELATVEMVKVLE - - SNDMYGAVGG - DVRILN 
155 DDDAWAVIDGDTVLDHGWKKTVPWFKL- FPNVGGLTT- -NEFCEV 
110 AS YE YVMCLBADTV IDDDAPFYMI EDFKK - NPKLGAVTG - -NPRIRN 
212 TDGDY I L I Fj|CDHVPTR - AFLQLTMGWMVEDP K I ALMQTPHHFYSPD 



Catalytic Site 3 Catalytic Site 4 Binding Site 

* * ** 

Has A 241 - 1 GDgRCLTN - -YAID-LO - RTVYQ - - STAR - CDTD - - VP FQLKS Y LKO^rSKNKS 

RfbB 230 - - - E§I ELE I - - E I VRKRG - RVLWNHNVRV YDEK-PDNLRIS LKQRY§WSKG 

Cps3S 213 -VSDiRSLTN - - - LTLKKQYKTVMQDTSW - - -YTDA- - PTSWKKF - - IRQQiMaeG 

NodC 223 -FGE&RHLTILMLK AGFRTGYVPSAVARTLVp|-GSPY - LROQlHaRS 

Chsl 654 YLAEgRILCFEWTKKNCNWILKYCRSSYAS TD--VPERVTEFI LR--rJ|SlnG 

D9f42 307 -IXSDgRHLTNRVLS MGYRTKYTHKSRAF Sj§ - -TPSLYLRW LNQQtHtKS 

Alg8 271 LTGDBKSSWFSLMR LGYDTFY- -VPDAAINTVHHIPPE KSFIKASRKLMY- -g^YGN 

IcaA 211 EDIAVS WK LH - LFYE IKYPRALCWMLVPET IGGLWKQRV^AQG 

ACSAB 318 - - - EMaHTALKMQR - - - LGWSTAYLR I PLAG - LATf RLI LH IGQRV^|ARG 

Fig. 8. Multiple sequence alignment of conserved structural regions in HasA, RfbBo^, and related proteins. Conserved residues are 
indicated by boldface. Residues that are 100% conserved are indicated by shaded boxes. Putative catalytic and binding subsites were identified by 
extrapolation from the inverting glycosidases. 



identified a number of proteins possessing regions of conserva- 
tion with the central hydrophilic domain. Among these, the 
proteins with known activities were all glycosyltransferases 
involved in synthesis of cell surface or secreted polysaccha- 
rides. The demonstrated activity of RfbA C:54 combined with the 
fact that rfbAB Q:M are sufficient for polymeric 0:54 LPS in an 
E. coli Arfb host suggest that RfbB Q:54 is a processive ManNAc 
transferase that acts after RfbAo :54 to polymerize the 0:54 
polysaccharide. Analysis of the sequence alignments obtained 
with RfbB 0:64 identified regions of homology with two known 
processive p-glycosyltransferases, Cps3S from S. pneumoniae 
(45, 46) and HasA from Group A S. pyogenes (47, 48). Both 
proteins have four predicted transmembrane domains with a 
large central hydrophilic region, and HCA of these hydrophilic 
domains revealed that RfbB 0:64 and Cps3S both possess the 
same multidomain structure and conserved residues previ- 
ously identified in the HasA family of processive 0-glycosyl- 
transferases recently described by Saxena etal. (53). There are 
two conserved structural regions in proteins within this family. 
The first region (domain A) is common to both the ExoU and 
HasA families, and the second (domain B) is specific to the 
HasA family. Domain B is subdivided into four regions and has 
a conserved Asp residue at the C- terminal end of region II and 
a conserved sequence motif (QXXRW) at the C-terminal end of 
region IV. A search of the data bases for proteins possessing the 
conserved features of this family of proteins identified 17 new 
members (Table II), including RfbB 0;64 , Cps3S, and IcaA. Of 
the proteins that were identified, most are known processive 
enzymes, including Cps3S; five are NodC proteins involved in 
the polymerization of the 0-1,4-linked Nod factors of Rhizobia; 
eight are chitin synthases, involved in the polymerization of 
0-1,4-linked GlcNAc polysaccharide chains of yeast chitin; and 
one is a cellulose synthase, involved in the polymerization of 
the 0-1,4-linked Glc chains of cellulose. The identification of 
RfbB 0;64 as a member of this family provides further evidence 
for the processive nature of this enzyme; however, it must be 
acknowledged that the data do not preclude the possible in- 
volvement of RfbAo :M in the elongation process. 

It is interesting to note that synthesis of the E. coli K5 
capsule also involves a processive glycosyltransferase, KfiC 
(55). In this polysaccharide, the repeating unit consists of al- 
ternating 0- and a-linkages, [^)-0-D-GlcpA-(l->4)-a-D-Glcp- 



NAc-(l-] (56). KfiC must therefore catalyze both a- and 0-link- 
ages from a-linked donor molecules. Sequence alignment 
between RfbB 0:54 and KfiC revealed that the K5 transferase 
possesses domain A but not domain B (data not shown), pro- 
viding further support for the role of each domain in the for- 
mation of 0-glycosidic bonds. 

Saxena et al. (36) have proposed a model for the processive 
mechanism of polymerization in the HasA family of proteins. 
The model accounts for the characteristic multidomain archi- 
tecture of the HasA family of proteins and, as with the mono- 
functional /3-glycosyltransferases, is based on the hypothesis 
that inverting anabolic glycosyltransferases use the same cat- 
alytic mechanism as inverting glycoside hydrolases (51). Ac- 
cording to this model, domains A and B represent different 
catalytic domains that together allow these proteins to catalyze 
two j3-glycosidic bonds, either simultaneously or sequentially. 
The subsequent loss of the two UDP groups from the catalytic 
sites is proposed to provide the driving force for the chain to 
move through the catalytic cleft until the terminal sugar inter- 
acts with the last binding subsite, allowing two more UDP 
sugars to enter. The simultaneous formation of two glycosidic 
linkages provides a simple mechanism for the generation of the 
2-fold screw axis that arises from a disaccharide repeat with 
two 0-glycosidic linkages, without invoking a concomitant ro- 
tation of either the enzyme or the substrate. It also provides an 
effective mechanism for maintaining the fidelity of a heteropo- 
lysaccharide disaccharide repeat. However, by analogy with 
the inverting glycosidases, such an activity would involve a 
total of four conserved acidic amino acid residues, two in each 
domain. Sequence and HCA plot alignments identified only one 
conserved acidic residue in domain B. Based on sequence con- 
servation in the chitin synthases, Nagahashi et al. (57) have 
also speculated that this residue is involved in catalysis. To 
further substantiate this hypothesis, these workers used site- 
specific mutagenesis to replace this conserved Asp (Asp-562) 
with the longer Glu and observed a 100% reduction in enzyme 
activity. They also replaced Asp-562 with an Asn, to determine 
whether the hydroxyl group of Asp was necessary for activity. 
Not surprisingly, a complete loss of activity was observed in the 
mutant. Because the inverting mechanism predicts two cata- 
lytic residues in domain B, the sequences of all of the known 
transferases in the HasA family were examined for additional 
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conserved amino acids. The search was limited to the region 
extending from the end of 04 to approximately 25 residues past 
the (QXXRW) motif. The C-tenninal end of the search region 
corresponds to the start of a predicted transmembrane domain 
in RfbB 0:54 , Cps3S, HasA, IcaA, and NodC. A highly conserved 
proline was identified a short distance in front of the (QXXRW) 
motif, in the junction between regions III and IV (Fig. 8). This 
Pro is predicted to lie within a loop at the C-terminal end of a 
0-sheet. In all of the proteins, an Asp or Glu was found 2-4 
residues before the conserved Pro. It is possible that the con- 
served carboxylate adjacent to this Pro represents the second 
catalytic residue of domain B. In RfbB 0:54f the catalytic resi- 
dues are therefore speculated to be Asp-92 or Asp-94, Asp- 151, 
Asp-244, and either Glu-268 or Asp-269. 

The position of the (QXXRW) motif relative to the fourth 
potential catalytic residue in these proteins suggests that the 
motif may represent part of a binding subsite. This possibility 
is supported by the predicted interactions of the residues in the 
motif with a carbohydrate ligand: hydrophobic interactions 
between the aromatic Trp residue and the hydrophobic patch of 
a glycosyl monomer, and hydrogen-bonding interactions be- 
tween the guanidinium side chain of Arg and the glycosidic 
hydroxyls. Noting that this motif was highly conserved in the 
chitin synthases, Nagahashi et al. (57) also speculated on the 
possible function of this motif in enzyme activity. To confirm a 
structure-function relationship, they used site-specific mu- 
tagenesis to individually replace each residue in the motif, and 
then measured enzyme activity and K m values in the resulting 
mutant proteins. In each case, a conservative change resulted 
in a reduction in activity and either an increase or decrease in 
binding affinity. While these results were interpreted as evi- 
dence for a role in catalysis, they could equally be interpreted 
as evidence for a role in binding of the substrate or in hydrogen- 
bonding interactions with a catalytic residue, thereby main- 
taining the correct orientation for catalysis. 

There are presently two known pathways for O-antigen bio- 
synthesis. These pathways are fundamentally different. Key 
criteria distinguishing the two are the cellular location of the 
polymerization reaction and the mode of export across the 
plasma membrane (6). In Rfc-dependent synthesis, polymeri- 
zation involves block-wise addition of single O-repeat units and 
occurs at the periplasmic face of the plasma membrane. Indi- 
vidual O-units are assembled on undecaprenol-P in the cyto- 
plasm and then transported across the plasma membrane, 
presumably by the O-unit transporter, RfhX (58). Ligation to 
lipid A core is catalyzed by RfaL. Rfc-independent O-polysac- 
charide biosynthesis is currently limited to homopolymeric O- 
poly saccharides. In this pathway, synthesis is initiated by the 
Rfe-dependent transfer of GlcNAc-l-P to undecaprenol-P. The 
complete O-polysaccharide chain is then polymerized in the 
cytoplasm prior to being delivered to the site of ligation by an 
r/b-encoded dedicated ABC transporter. At first glance, the 
pathway for 0:54 polysaccharide synthesis appears similar to 
the Rfc-independent pathway of O-antigen synthesis. In both 
cases, synthesis is initiated by Rfe and the O-repeat is a ho- 
mopolymer. In addition, by analogy with other more well-char- 
acterized HasA-related proteins, RfbB 0:64 is expected to poly- 
merize the 0:54 polysaccharide in the cytoplasm, where pools 
of activated precursor are available. It is at this point in the 
pathway that 0:54 biosynthesis diverges from the Rfc-inde- 
pendent pathway, there is no dedicated ABC transporter for 
export of the polymerized 0:54 polysaccharide. This is the only 
known O-polysaccharide system that does not encode either an 
ABC transporter or an RfhX O-unit transporter. Despite the 
absence of either of these components, smooth 0:54 LPS is 
expressed in a 6rfb E. coli K-12 host strain containing only 



rfbAo-g^ and r/6B 0:54 . We have previously shown that, in the 
absence of the cognate ABC transporter, an E. coli K-12 strain 
containing the remaining rfb genes from K pneumoniae 01 
(Rfc-independent) accumulates O-antigen in the cytoplasm 
(13). There is, therefore, no alternate, generic O-polysaccharide 
export system in E. coli. Consequently 0:54 synthesis repre- 
sents a new pathway for O-antigen assembly, involving a dif- 
ferent mechanism for delivering nascent O-polysaccharide to 
the LPS O-antigen ligase. 

Intriguingly, no export system has yet been identified for 
polymers produced by a number of other processive /3-glycosyl- 
transferases in the HasA family. These polysaccharides include 
bacterial cellulose (Acetobacter xylinum and Agrobacterium 
tumifaciens (53, 59, 60)), hyaluronic acid (S. pyogenes (47, 48, 
61), the type 3 capsule of S. pneumoniae (45, 46), alginate (P. 
aeruginosa (62)), and chitin (S. cerevisiae, Candida albicans y 
Emericella nidulans and Neurospora crassa (63-68)). In con- 
trast, transmembrane transport of the rhizobial Nod factors 
occurs through the action of a dedicated ABC transporter (69, 
70). However, synthesis of Nod signal factor differs somewhat 
from that of cell-surface polysaccharides, as a carrier lipid has 
not yet been identified; the polymer that is synthesized is much 
shorter and is generally thought to be secreted (71); and the 
oligosaccharide product is substituted with acyl, acetyl, and 
sulfate groups in a strain-specific manner (72). The NodC pro- 
teins are all highly homologous, and their predicted topology 
differs from that of RfbB 0;64 , HasA, and Cps3S. NodC proteins 
may therefore represent a subfamily of the HasA family. The 
predicted topology of RfbB 0:54 , with a periplasmic loop follow- 
ing the cytoplasmic glycosyltransferase domain, suggests the 
possibility that the protein possesses two separate activities, 
catalyzing the polymerization of the 0:54 polysaccharide and 
coupling this with transport in a vectorial reaction, similar to 
that suggested for chitin synthases (73). The C-terminal trans- 
membrane domains would therefore be predicted to form a pore 
or channel through which the growing chain is extruded. Sim- 
ilarities in size, two-dimensional architecture, and hydropathy 
plots point to the possibility of a similar transferase/transport 
function for Cps3S, IcaA, and HasA. 

Synthesis of the 0:54 polysaccharide clearly represents a 
third and new pathway for O-antigen assembly. With the iden- 
tification of potential catalytic residues in RfbB 0:64 and the 
speculated role for the periplasmic loop in transmembrane 
transport, the 0:54 O-polysaccharide provides a relatively sim- 
ple system for testing this putative export function and for 
examining the mechanism of catalysis of /3-glycosyltrans- 
ferases. These analyses will also serve to characterize a novel 
O-polysaccharide biosynthetic pathway. 
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